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ABSTRACT 

Due to recent availability of large surveys, there is renewed interest in third- 
order correlation statistics. Measures of third-order clustering are sensitive to 
the structure of filaments and voids in the universe and are useful for studying 
large-scale structure. Thus statistics of these third-order measures can be used 
to test and constrain parameters in cosmological models. 

Third-order measures such as the three-point correlation function are now 
commonly estimated for galaxy surveys. Studies on third-order clustering of ab- 
sorption systems will complement these analyses. We define a statistic, which we 
denote as /C, that measures third-order clustering of a dataset of point observa- 
tions, and focus on the estimation of this statistic for an absorber catalog. The 
statistic /C can be considered a third-order version of the second-order Ripley's 
K function and allows one to study the abundance of various configurations of 
point triplets. In particular configurations consisting of point triplets that lie 
close to a straight line can be examined. 

Studying third-order clustering of absorbers requires consideration of the ab- 
sorbers as a three-dimensional process, observed on Quasi-stellar object (QSO) 
lines of sight that extend radially in three-dimensional space from the Earth. 
Since most of this three-dimensional space is not probed by the lines of sight, 
edge corrections become important. We use an analytical form of edge correction 
weights and construct an estimator of the statistic /C for use with an absorber 
catalog. We show that with these weights, ratio-unbiased estimates of JC can be 
obtained. Results from a simulation study also verify unbiasedness and provide 
information on the decrease of standard errors with increasing number of lines of 
sight. 

Subject headings: cosmology:large-scale structure of universe, methods: statistical 
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Introduction 



The C IV and Mg ll Quasi-stellar object (QSO) absorption systems or absorbers appear 



fCrotts 


1985: 


Crotts et al. 


1985 



Tytler et al.l Il993l ) . See iTripp &: BowenI (120051 ) for a discussion on the connection between 
galaxies and QSO absorbers using observations at low redshift. 



The clustering of such absorbers were studied in a series of i nvestigations (jVanden Berk et al 



19961 : iQuashnock et al.lll996l : iQuashnock fc Vanden Berklll998l ) using an extensive catalog of 
absorbers drawn from the literature. Specifically, they performed a second-order correlation 
analysis in one dimension, restricting to absorber pairs lying on the same QSO lines of sight, 
and found clustering on very large sca les, up to 50 to 100 /;- ~^Mpc. This supercluster ing has 
also been found in other studies, e.g. iHeisler et al.l (119891 ): iDinshaw fc Impeyl (119961 ). This 
work on the second-order clustering of absorbers complements the analyses of second-order 
structure of other astronomical objects such as galaxies, quasars and the cosmic microwave 
background. 

In ast ronomy, a common measure of second-order structure is the two-point correlation 
function ^ JPeebleslll98ol . ll993h . and this is the function used in the above-mentioned studies. 
Another measure of second -order cluste r ing is the reduced seco nd moment function, also 
called Ripley's K function (jRipleylll988l : iMartmez fc Saarll2002l ). In three dimensions, the 
K function is related to the two-point correlation function by 

K{r) =Att u^[l + ^{u)]du. 



Martinez et al.l (119981) ap plied the K function to galaxy surve ys, while lOuashnock fc Stein 
( 1999 ): Stein et al. (2000) used it to examine clustering of the Vanden Berk et al. ( 19961 ) C 
IV absorber catalog. 



Loh et al.l (1200 ll ) e xtended the work of 



Ouashnock &: SteinI (119991 ) in the study of second- 



order clustering of the IVanden Berk et al.l (Il996l ) absorber catalog by considering the ab- 
sorbers as a process occurring in three dimensions. By treating the absorbers as a three- 
dimensional process, absorber pairs that lie on different lines of sight were included in esti- 
mates of the K function. As a result, the estimates obtained were shown to have dramatically 
smaller standard errors than estimates obtained by only considering the absorbers as a one- 
dimensional process on the lines of sight, when there is a large enough number of lines of 
sight. 



More recently, there has been interest in higher-order clustering, in particular in third- 
order clustering, partly because of limitations of restricting to second moments and partly 
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because datasets are now large enough for third-order statistics to be estimated. In par- 
ticular, the structure of filaments and voids that is present in ga laxy surveys is more read- 
ily, though still inadequately, describ ed by third-order stat i stics (IGaztanaga fc Scoccimarro 



20051: 



Nichol et al. 



2006h: 



Sefusatti &: Scoccimarrol [20051 ) . See Jing fc Borner ( 1998 ): Gaztanaga et al. (2005); 



Kul 



Karni 



, i et al. J2007h for some examples of the three-point corr e lation 

function ( Fry fc PeeblesI ~98ol ) applied to galaxy surveys. The Vanden Berk et al. ( 1996 ) 
absorber catalog, which has 276 lines of sight and 345 C iv absorbers, is too small for in- 
vestigat ing third-order st ructure, but the catalog being gathered by the Sloan Digital Sky 
Survey (lYork et al.ll2000l ) will have many more absorbers and lines of sight, making a study 
of the third-order structure of absorbers feasible. 

Here, we are concerned with estimating the third-order structure of an absorber catalog. 
With estimates describing the third-order structure of an absorber catalog, one can compare 
these estimates with corresponding estimates from galaxy surveys. For example, one can 
study whether absorbers lie along filaments like galaxies do. 

In Section [3] we will define a third-order version of the K function and relate it to the 
three-point correlation function. We provide edge correction weights for its estimation for 
an absorber catalog. We provide mathematical details in the Appendix and results of a 
simulation study (Section Hj) to show that these weights do p roperly account f or the edge 
effects. Since these weights make use of the weights found in iLoh et al.l (l200ll ). we briefly 
summarize their method of finding correction weights (Section [2]). Section O contains a brief 
summary and discussion of the application of this work for studying galaxy clustering and 
large-scale structure. 



2. Estimation of K for an absorber catalog 



Here we briefly describe the method of iLoh et al.l (120011 ) for finding correction weights 



for estimating the K function from an absorber catalog. 

Figured] is a schematic diagram that shows absorbers lying on some lines of sight. The 
solid lines represent lines of sight, and the solid circles, absorbers at x and y. The dashed 
circles represent a shell centered at x with radius |y — x| and thickness du, which we will 
denote by 5Bdu{^i |y ~ This shell has volume 47r|y — xprf-u. The point b represents an 
intersection point of 5i?o(x, |y — x|) and L, the set of lines of sight. Note that with regards 
to notation, in this paper we will use x, y, z to represent locations of absorbers on L and 
a, b, c to refer to general locations on L which may or may not have absorbers present. 
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du 

Fig. 1. — A schematic diagram showing absorbers at x and y. The sohd hnes represent 
hnes of sight, and the dashed hnes the shell centered at x, with radius |y — x| and thickness 
du. Each of the shaded rectangles represents cylinders in three-dimensional space and shows 
where an absorber center must lie in order to be detected at that particular location on a 
line of sight. 
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The two shaded rectangles on this shell represent cylinders in three-dimensional space. 
If the center of the absorber y lies in a cylinder, it will be observed on the line of sight that 
passes through the center of that cylinder. When the absorber at x is at the center, the 
contribution of the absorber at y to the estimate of K needs to be corrected for boundary 
effects, i.e. for the points in the shell SBdui^, |y — x|) not probed by the lines of sight so 
that an absorber could be present there but not observable. Using Ripley's method of edge 
correction, the weight is the reciprocal of the probability of detecting an absorber, given that 
the absorber is present in the shell. This weight is approximated by the ratio of the volume 
of the shell 5i?d„(x, |y — x|) to the volume of the cylinders. Specifically, the weight is given 
by 

47r|y — xp 



47r|y — xpc/w 4 



(1) 



where /x,y is the set of points in 5i?o(x, |y — x|) fl L, and 9p is the angle subtended by the 
line of sight that point p lies on, and the line joining point p and absorber x. In Figure [H 
/x,y just consists of the points y and b. The angles 9y and 9^ are also indicated in Figure 
[H The variable d is the radius of an absorber in comoving units, and we assume it to be 
unknown, but fixed. The value of d does not need to be specified because it gets cancelled 
away and does not appear in the estimator for K. For more details, see Section [3] where this 
cancellation also occurs for the estimator of the third-order statistic /C. 



With weights specified by ([T]), iLoh et al.l (120011 ) show that estimates for X^K are unbi- 
ased. 



3. Estimating third-order statistics for the absorber catalog 



It is well-known that second-order st atistics do not complet e ly de scribe the clustering 
properties of point processes. For example, iBaddeley fc SilvermanI (Il984j ) provide an example 
of a non-Poisson process with the K function identical to that of a homogeneous Poisson 
process. Third- and higher-order statistics will allow a more detailed study of clustering 
than just second-order statistics. 



Peebles &: GrothI (119751 ) defined a three-point correlation function ( and applied it to 



the Zwicky catalog. Wit h three volume elements dVi, dV2 and dV^ that define a triangle with 
lengths ri2,r23 and ris, [Peebles fc GrothI (119751 ) wrote the probability of finding an object 
in each of these elements as 



dP = \^[l + ^in2)+ar23)+^ins)+ari2,r23,ru)]dVidV2dV3, (2) 
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where A is the intensity or number density of the point process. Subsequent studies using the 
three-point correlation function frequently used a different description of the configuration 
of triplets, employing two distance measures and one angle measure: s = r^^?,,q = r^.^/ri^ an d 



9, the angle between r^2 snad r23. See, for example, iGaztanaga et al.l (120051 ): iNichol et al. 



(120061 ): iKulkarni et al.l (120071 ). Note that in the above notation the angle is subtended at the 
second point. We will use the parametrization of two distances and an angle in this work. 

For the study of galaxy surveys, a related quantity Q, called the reduced three-point 
correlation function is often used, where 



as, q, 9) = Q{s, q, 9) X [e(ri2)e(r23) + e(r23)e(ri3) + e(ri3)e(ri2)]. 



(3) 



The hierarchical form of ([3]) was proposed in Peebles &: Groth ( IQTsI ) based on their anal- 
yses of the Lick and Zwicky catalogs. It is an empirical form witho ut theoretical suppor t 
(jjing fc Bornerlll998l ). but has been found to hold in other studies e.g. ISzapudi et al.l (l200ll ). 
In analyses of the 2dFGRS and SPSS galaxy survey s, there appears to be variation of Q 



with 9 (e.g. iGaztafiaga et al.ll2005l : iNichol et al.l 120061 ) 



In the statistics literature, the quantity dP in 
terms of a function g^^^: 

^3^(3) 



above is more commonly expressed in 



dP = X'g^'\ru,r2s,9)dVidV2dV3, 



(4) 



where we have used two distances and an angle for the paramete r s of q ^^\ lM0ller et al. 
(119981 ) refer to X^g^^^ as the third-product density. lM0ller et al.l (119981 ) also designed a 
third-order sta tistic to distinguish between certain classes of point process models, while 
HanischI (Il983l) used a third-order s tatistic to examine inner linearities in point patterns. 
See also ISchladitz fc Baddeleyl (120001 ). These third-order statistics are integrated versions of 
the third-product density, and can be considered third-order versions of the second-order K 
function. For our purposes, we define such a third-order function, which we denote by JC: 



/C((0,i?i],(i?2,i?3],f^) 



R2 Jn 



4nrl^rl^g^^\ru,r23,9){2'Ksm9)d9dr23dri2, (5) 



where (a, b] denotes an interval that includes b b ut not a and Q^lai, a2],0 < ai < a2 < n 
is a range of angles. The third-order statistic of iM^ller et al.l (119981 ) corresponds to JC with 
i?2 = 0,i?3 = Ri and fl = [0,7r]. The quantity /C((0, (i?2, -Rs], ^^) has an intuitive 
interpretation: given a randomly chosen object at x, A^/C((0, -Ri], (i?2, R3], ^) is the expected 
number of object pairs at y and z such that |y — x| G (0,i?i], |z — x| G {R2,R3] and the 
angle subtended by y and z at x, Zyxz, is in Q. To relate to the notation in note that 
^12 = |y ~ x| and r23 = |z — x|, so that x is point 2, at which the angle is subtended. Of 
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particular interest is the case when 6 is close to or tt, since this describes the property of 
finding triplets of points that lie close to a line. We will also be interested in the variation 
of K, with VL. 

Both the three-point and reduced three-point correlation functions can be obtained from 
/C. Consider /C((0, -Ri], (0, -R3], fi) where f2 is a small angle range, {9 — 59/2, 9 + 59/2), say. 
Then from ([5]) we have, 



dRidR^ 



47iS{Q)RiRlg^^\Ri,R: 



3,^ 



(6) 



where S{Q) is the solid angle formed by the part of the unit sphere that subtends an angle 
6' G to the X-axis. Using ([2]), ([3]) and (jl]), ( and thus Q can be expressed in terms of g^^\ 
Therefore estimates for one quantity can be converted to estimates for the other quantities. 

There are different advantages to estimating /C versus (. Since K. is an integral quantity, 
it is often smoother and thus its estimates might have better theoretical properties. It also 
separates the choice of bin size from the edge correction weights, so if a study of the effect 
of bin s ize or of different edge correction methods is desired, it may be more appropriate to 
use /C (IStein et al.l 120001 ). On the other hand, the hierarchical form of ([3]) is more simply 
expressed using ( and Q. Having estimates of /C, ( and Q allows for more flexibility in 
studying the clustering present in a dataset, so rather than advocating for one statistic over 
another, we recommend using all these statistics as tools for a detailed analysis. 

When studying clustering of a point pattern observed in a flnite region, it is important 
to account for the boundary of the observation region. If a point falls near a boundary, we 
do not get to observe all its neighboring points. This is a particularly important issue for an 
absorber catalog, since only a small portion of the three-dimensional space is probed by the 
lines of sight. In order to obtain unbiased estimates, point pairs that are observed have to 
be reweighted to account for the boundary effect. There are various methods to do the edge 
correction. These can be numerical suc h as in the estimators of th e two-point correlation 
function introduced bv [Payis fc Peebles ( 1983) and Hamilton ( 1993 ). or an alytical such as 
those introduced by Ripley ( 19881 ) and" ohser (1983) for the K function. See Kerscher et al. 
(I2OOOI ) for a good review in the astronomy context. 



Loh et a 



(I2OOII) foun d correction weights, based on the same correction procedure 



suggested by iRipleyl (119881). for est i matin g the K function for the IVanden Berk et al.l (119961 ) 
C IV absorber catalog. iLoh et al.l (120031 ) found expressions of the cor r ection weigh ts based 
on Ohser's and Stoyan's correction methods (lOhser fc StoyanI Il98ll : lOhseii Il983l ). Here, 
we obtain edge correction weights for estimating the third-order moment function /C using 
Ripley's method. 



- 8 - 



Note that since estimating a third-order statistic involves counting triplets of points, the 
more common analysis approach of treating the absorbers as a one-dimensional process on 
the lines of sight cannot be used. The absorbers have to be treated as a three-dimensional 
process, and the edge effects caused by the large regions of unobserved space have to be 
accounted for. 

For fixed values of i?2, -R3, «i and 02, we estimate /C by first estimating X^AK. with 
l(o,fli](|y - x|)l(ij2,R3](|z - x|)lf^(Zyxz)^x(|y - x|, |z - x|, fi)r(|y - x|, |z - x|, fi), (7) 

and then dividing the estimate X^AJC by an estimator of X^A. Here, L is the set of lines of 
sight, A = Trd'^\L\ is the volume probed by the lines of sight, d is the constant radius of an 
absorber in comoving units, Zyxz is the angle subtended by y and z at x, and for any set 
S, ls{u) is an indicator function, equal to 1 if m G S" and otherwise. 



We find that with 



^x(|y 



x|,n) 



xPlz 



{-Kd^fiduY EpG/x,y ^qe/x,z If^(^Pxq)/ (cos^p COS^q) 



47rS'(n)|y-x|2|z-x| 



V^(|y-x|,|z-x|,^]) 



i.T'd^Y ^pei.,y Eqe/x,. ln(^pxq)/(cos Op cos ^q) ' 
A \L\ 



TTd'^\L{\y — x|, |z — x|, n)| |L(|y — x|, |z — x|, f2) I 



(9) 



the estimator in ([7]) is unbiased for X^AJC. The quantity Wx(|y — x;|, |z — x|,f2) is the 
correction weight nee ded to account for the edge effects. It is also called the local weight in 



Kerscher et a 



(120001) . The quantity V(|y — x|, |z — x|, Q) is sometimes referred to as Ohser's 



factor (lOhse and makes the estimator valid for longer distances. It is called a global 



weight in lKerscher et al.l (I2OOOI ). For the rest of this section, we explain how the expressions 
in (IHl) and (l9l) are obtained. 



The denominator in the right-hand side of (IHl) is related to the weights found in lLoh et al. 



(120011 ) , specifically, to the denominator in the right-hand side of ([T]) . In ([T]) , the denominator 
is the sum of the volumes of the cylinders associated with the intersection of 5-Brfu(x, |y — x|) 
with the set of lines of sight L, less a factor of du. These cylinders are shown in Figure [T] 
and are shown again in the lower left portion of Figure O 
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du 

Fig. 2. — A schematic diagram similar to Figure [1], with an additional absorber at z, and 
a shell centered at x, with radius |z — x| and thickness du. The triplet of absorbers y,x, z 
is of the desired configuration. The triplet of points b, x, c corresponds to a set of locations 
in L where a triplet of absorbers of the desired configuration could potentially have been 
observed. 
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For ([8]), we need to consider the intersections of 5Bdu{'^, |z — x|) with L as well, rep- 
resented by the outer shell in Figure [2l Like in ([1]), /x,y = 6Bq{-x, |y — x|) fl L where 
5i?o(x, |y — x|) is the sphere centered at x with radius |y — x|. The definition for /x,z is 
similar. With respect to Figure /x,y contains the locations b and y, while /x,z contains 
the locations c and z. 

To get the denominator on the right-hand side of ([8]), we consider pairs of cylinders, 
one on the outer shell and one on the inner shell, i.e. a cylinder associated with a point 
q in /x,z and another associated with a point p in Jx,y. Each product of the volumes of 
these pairs of cylinders, equal to ndP^du)/ cosO-p x ncP{du)/ cos9q, is included in the sum 
only if the angle subtended at x by the centers of the cylinder pair is in the range specified 
by Q, i.e. if In(Zpxq) = 1. In Figure these pairs are highlighted by rectangles that are 
similarly shaded. Note that the (du)"^ term cancels because there is a corresponding term in 
the numerator of ([8]). It is also worth noting that the numerator of ([8]) has a form similar to 
the right-hand side of IQ. 

There may be locations in L that cannot be a possible location for the absorber x of a 
triplet y, x, z of the desired configuration. Which locations these are depend on the actual 
positions and lengths of the lines of sight in L. The quantity in ([9]) accounts for this. 
Each location a G L is in the set L(|y — x|, |z — x|, fi) if there are points b, c G L such that 
|b — a| = |y — x|, |c — a| = |z — x| and the angle subtended by b and c at a, Zbac, is in Q, 
i.e. Ldy — x|, |z — x|, fi) is just the set {a G L : 3b, c G L with |b — a| = |y — x|, |c — a| = 
|z — x|, Zbac G So, by definition, x of Figure [2] has to be in L(|y — x|, |z — x|,fi). 

To get an estimate of /C, we divide the estimator ([7]) by an estimate of X^A, e.g. 
{N^/A^)A = A^^/(7r(i^)^|Lp. Thus, although the expression for cjx includes a (vrrf^)^ term, 
the value of d need not be specified when estimating /C since it gets cancelled away by the 
same term in the estimate of X^A. 

The proof of unbiasedness is provided in the Appendix. Note that it is the estimator 
of X^AK, that is unbiased. The estimate /C that is obtained by dividing by an estimate of 
X^A may be slightly biased. Such a property is called ratio-unbiasedness, and is a feature of 
estimators of the second-order K function as well. 



4. Simulation Study 

We ran a simulation study to explore the performance of the estimator given in ([7j), with 
weights given in ([8]) and (Q. Note that distances referred to here are comoving distances. 
We first generated a set of 1000 lines of sight in a region similar to that to be probed by 
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the QSO lines of sight of the SDSS Catalog: a cone with half-angle of 45° with Earth at 
its tip, bounded by comoving distance 2000 < r < 3300/i~^ Mpc from Earth. This range of 
distances corresponds to the comoving distances probed by QSO lines of sight for Mg ll and 
C IV absorbers under the Einstein-de Sitter cosmology. A thousand realizations of a Poisson 
poi nt process are then s i mulat ed on to these lines of sight, with density equal to that found in 



the lVanden Berk et al.l (Il996l ) catalog, 0.004 per Mpc. We chose Poisson processes since 
the theoretical value of /C is known for the Poisson model: P = /CpQi((0, Ri], {R2, R3], fl) = 
4:TtS{Q){RI — -R|)-Rf/9. For each realization, we estimate the third-order function /C. We 
then find the mean and variance of these estimates, and compare it with the theoretical 
Poisson value P. The results are shown in Figures [3] and |H 

Figure [3] shows the ratio of the mean estimates of /C((0, 50], (250, r], Q) to the expected 
Poisson value, for = (0°,5°) (top left), n = (40°, 50°) (top right), n = (55°, 60°) (bottom 
left) and Q = (80°, 90°) (bottom right), plotted as a function of r, for 250 < r < 330 h'^ 
Mpc (solid lines) . The dashed lines show the pointwise error, equal to two times the standard 
deviation of the 1000 estimates. Notice that in each case, the true value of 1 lies within this 
band. Furthermore the mean estimated value is very close to 1 for the smaller angle ranges, 
with a slight bias appearing with angles close to 90°. We believe this is because the edge 
correction approximation becomes less accurate at angles close to 90°. 

Figure H] shows plots of the same ratio as a function of 6, the midpoint of Q, from 0° 
to 90°, for values of r fixed at 260, 280, 300 and 320 Mpc. The angular bin size used is 
10°. Again, we find that the pointwise confidence band contains the true value 1, with the 
mean value also close to 1. These plots show the bias appearing as the angle is increased to 
90°, with this bias becoming slightly less as the range (250, r] increases. 
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Fig. 3. — Plots of the mean, over 1000 simulated realizations, of the ratio of 
/C((0, 50], (250,r],Q) to the expected Poisson value as a function of r from 250 to 330 h'^ 
Mpc. The Q for each plot is specified at the top of plot. The dashed lines refer to pointwise 
errors that are twice the standard deviation obtained from the simulations. 



r= 260 



r= 280 




Fig. 4. — Plots of the mean, over 1000 simulated realizations, of the ratio of 
/C((0, 50], (250, r], to the expected Poisson value as a function of 6 from to 90°, us- 
ing angular bins of 10°. The value of r for each plot is specified at the top of plot. The 
dashed lines refer to pointwise errors that are twice the standard deviation obtained from 
the simulations. 
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We chose the distance 50 Mpc for since it correspo nds roughly to the scale of 
superclustering that has been detected (iQuashnock et al.lll996l ). With the values of G 
(250, 3 30] that we use d , the ratio r23/ri2 is then about 5 or 6, close to the values considered 



in e.g. iKulkarni et al.l (120071 ). although the values of ri2 considered there are much smaller. 
We also considered values of 10 to 40 Mpc for Ri. The results are qualitatively similar 
to the results for Ri = 50h~^ Mpc, except with slightly larger standard errors. 

We also performed a simulation study using 10,000 lines of sight in the same region. The 
corresponding plots are similar to those of Figures [3] and HJ but with standard errors smaller 
by a factor of about 10 to 15, i.e. roughly of the order of the increase in the number of lines 
of sight. For fl = (0°,5°), standard errors dropped only by about a factor of 5, however. 
This approximat e relation between standard errors and the number of lines of sight is similar 
to that found bv lhoh et aP fl200lh . 



5. Discussion and Conclusion 

Measures of the third-order clustering of galaxy surveys and the cosmic microwave 
background are useful for the additional information they provide over measures of second- 
order clustering. In particular, the filamentary structure that has been found in galaxy data 
is more readily described by third- and higher-order measures of clustering. Recently, due 
to the availability of larger datasets and advances in computing, such study of higher-order 
clustering has been the subject of active research. 

It will be desirable to study the third-order clustering of absorption systems. Absorbers 
are often detected at extreme comoving distances from the Earth. Since absorbers are 
believed to be due to gas clouds near galaxies, an analysis of the third-order clustering of 
absorbers can serve as a complementary analysis to that of large galaxy surveys, enabling 
comparison of the local filamentary structure to that of the early universe. Studying the 
third-order structure of absorbers might also provide greater understanding of their nature. 
Finally, absorption systems consist of non-luminous matter. Understanding the clustering 
of absorbers can yield insight into the link between luminous and non-luminous matter in 
the universe. 

In this paper, we define a third-order moment function /C that is an integrated version of 
the three-point correlation function, much like the relation between the second-order Ripley's 
K function and the two-point correlation function. We provide expressions for the weights 
necessary to correct for the boundary effects so that this function can be estimated for an 
absorber catalog. Our simulation study shows that the estimator gives correct results (i.e. 
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including correctly accounting for the boundary effects), at least for the theoretically simple 
Poisson process. 

Studies on large-scale structure with gala xy surveys have shown the existence of struc- 
tures of the order of 100 h^^ Mpc in size (IKirshner et al.l Il98ll : iGeller fc Huchral Il989 



da Costa et al. Il994r). In a nalys es of second-order clustering of the Las Campanas and 



SDSS surveys, iLandy et al.l fl 19961) and lEisenstein et al.l (120051) respectively found peaks on 



scales of around 100 Mpc. iQuashnock et al.l (119961 ) also found evidence o f sup e r clus- 
tering on these scales in their analysis of C IV absorption systems. iLoh et al.l (l200ll ) also 
found evidence of clustering up to 100 Mpc and poss i bly beyond. Studies on g alaxy 
clustering have focused on smaller scales. iGaztafiaga et al.l (I20051): Nichol et al.l (120061 ) and 
Kulkarni et al. ( 2007 ) followed the example of Ijing &: Bornei ( 1998 ). using ri2 from 1 to 10 
Mpc and r2z/ri2 between 1 and 4, and studied the variation in the reduced three-point 
correlation function Q with angle. 

The choice of Ri,R2,R3 and Q for / C would thus d e pend on the aim of the analysis. 
For comparisons with the findings of e.g. iJing fc Borneii (119981 ). the focus will be to study 
the variation of /C with Q, with the distance measures close to the values used there. For 
studies on superclustering and large-scale structure, comoving distances of 100 Mpc and 
beyond for one or both of ru and will be of interest. An initial study will probably use 
Q = [0,7r], studying clustering at various distances, followed by more detailed analyses with 
smaller angular ranges. 

We are not aware of any other work on estimating third-order clustering specifically 
for absorber catalogs. Unfortunately, we do not have a large enough catalog of absorbers 
to obtain meaningful estimates of the third-order function. With the much larger absorber 
catalog that is being collected by the Sloan Digital Sky Survey, a detailed study of the third- 
order clustering of absorbers will become feasible. From our simulation studies, we found 
that the standard errors of estimates of /C for an absorber catalog scale roughly o n the order 



of the reciprocal of the number of lines of sight. This agrees with the findings of iLoh et al. 



(I2OOII ) for standard errors of estimates of the K function for absorber catalogs. Thus we 
expect that with the SDSS absorber catalog with approximately 50,000 lines of sight, the 
standard errors of the estimates of /C will be roughly a factor of 50 smaller than the standard 
errors found in our simulation study with 1000 lines of sight. The actual increase in precision 
for a particular absorber catalog will of course depend on factors such as the actual spatial 
locations of the lines of sight and the density of the observed absorbers. 
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A. Appendix 

Here, we prove that the estimator ([7]) with weights and V given by ([8]) and Qj is 
unbiased. Let L represent the lines of sight, 5-Ba(x, h) denote a shell with center x, radius 
h and thickness A. We write (/i,7) for the polar coordinates of vector h, | ■ | for Euclidean 
distance, area or volume depending on the context, A = 7id'^\L\ for the volume probed by the 
lines of sight and 1l(x) for the indicator function, with 1l(x) = 1 if x G L and otherwise. 

Write /(x,y,z) = U(x)U(y)lL(z)l(o,iji](|y-x|)l(K2,i?3](|z-x|)ln(Zyxz)u;x(|y-x|, |z- 
x|,i7)V(|y — x|, |z — x|,fi)), where Q = [ai,a2] represents the range of angles between cti 
and a2- Then the estimator in (JTj) is X^x^y^^z '^)^ with 



Vx^^VT^z 



E\ ^ f{x,y,z)\ = \^ 1 1 1 f{x,x + h,x + k)g^^\x,x + h,x + k)dhdkdx 

"Ri 



= X' [ /YY / U(x)U(x+(/i,7))U(x+(fc,/?)) 

Jr3 Jo J R2 JSBo(0,k) J5Bo{Q,h) 

X 1^(7 - /5)cux(/i, fc, n)V{h, k, n)h^Pg^^\{h, 7), (fc, (3)) d-f d(3 dh dk c/x. 

In the first equality above, we have expressed g^^^ in terms of three vector quantities x, x + h 
and X + k. If stationarity is assumed, the specification of x in g^^^ is redundant. Thus we 
have removed the dependence on x in g^^^ in the next line. We have also expressed h and k 
in polar coordinates. Now, with the further assumption of isotropy, g^^\h,k) depends only 
on the direction of h relative to k (or vice versa). This simplifies the expression above, so 
that 

V /(x,y,z) = AM / / V{h,k,n)lL{^)uj^{h,k,n) 

x^y^z / -^^'-^O Jr2 

U(x+(A;,/5))fc2 



SBoiO,k) 

X / lQ{a)lL{'K + {h, (3 + a))h'^g^^\h, k,a) da d/3dhdkd:x., 

.JsBo{0,h) J 

where a denotes the angle on the sphere relative to {k,P). Under the assumption that g^^^ 
is slowly varying over Q, the expression in the square bracket above is equal to 

nd"^ 

{^)g^'\h,k,n) Yl ln(Zpx(x + k))^— , 

COS f/T-i 

pe5-Bo(x,h)nL ^ 

so that 



Vx^y^^^z 
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X 



/ lL(M+n)(x)lL(x+(/c,/3)) 

JdBo{0,k) 



X 



J2 lQ(Zpx(x + k)) 



cos 9^ 

p6<5Bo(x,/i)nL 



k'^dp 



dh dk dx 



X 



^ ^ In(Zpxq) — ^^^^ I dhdkdx. 

\peSBo{x,h)nL qeaBo(x,fe)nL 



cos 6'p cos 9q 



The above expression in the round brackets is the denominator of u!:x.{h, k,fl). Further 
simphfication yields 



eI J2 /(x,y,z)) = A^^ 



Ri /"-R3 



lL(/i,fc,J1)(x) 

US Jo jR^nd'^\L{h,k,n)\ 

Ri 



A'Kh^k'^S{n)g^^^ {h, k, n) dh dk dx 



^ X^A I I 47ih^k^S{n)g^^\h,k,n)dhdk 

Jo Jr2 

= X^A)C{(0,R^],{R2,R3],n). 
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